首页> 外文OA文献 >Analysis of gradient descent methods with non-diminishing, bounded errors

【2h】

Analysis of gradient descent methods with non-diminishing, bounded errors

机译：具有非递减，有界的梯度下降方法的分析错误

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

The main aim of this paper is to provide an analysis of gradient descent (GD)algorithms with gradient errors that do not necessarily vanish, asymptotically.In particular, sufficient conditions are presented for both stability (almostsure boundedness of the iterates) and convergence of GD with bounded,(possibly) non-diminishing gradient errors. In addition to ensuring stability,such an algorithm is shown to converge to a small neighborhood of the minimumset, which depends on the gradient errors. It is worth noting that the mainresult of this paper can be used to show that GD with asymptotically vanishingerrors indeed converges to the minimum set. The results presented herein arenot only more general when compared to previous results, but our analysis of GDwith errors is new to the literature to the best of our knowledge. Our workextends the contributions of Mangasarian & Solodov, Bertsekas & Tsitsiklis andTadic & Doucet. Using our framework, a simple yet effective implementation ofGD using simultaneous perturbation stochastic approximations (SP SA), withconstant sensitivity parameters, is presented. Another important improvementover many previous results is that there are no `additional' restrictionsimposed on the step-sizes. In machine learning applications where step-sizesare related to learning rates, our assumptions, unlike those of other papers,do not affect these learning rates. Finally, we present experimental results tovalidate our theory.

机译：本文的主要目的是提供梯度下降（GD）算法，该算法具有不一定会渐近消失的梯度误差，尤其是为GD的稳定性（迭代的几乎确定有界性）和收敛提供了充分的条件具有有限的（可能）不减小的梯度误差。除了确保稳定性之外，这种算法还显示收敛于最小值集的一小部分邻域，这取决于梯度误差。值得注意的是，本文的主要结果可以用来证明具有渐近消失误差的GD确实收敛到最小集。与以前的结果相比，本文介绍的结果不仅更笼统，而且据我们所知，我们对带错误的GD的分析是文献中的新内容。我们的工作扩展了Mangasarian＆Solodov，Bertsekas＆Tsitsiklis和Tadic＆Doucet的贡献。使用我们的框架，提出了一种简单而有效的GD算法，该算法使用具有恒定灵敏度参数的同时扰动随机逼近（SP SA）。对许多先前结果的另一个重要改进是，步长没有施加“附加”限制。在步长与学习率相关的机器学习应用中，与其他论文的假设不同，我们的假设不会影响这些学习率。最后，我们提出实验结果以验证我们的理论。

著录项

作者
Ramaswamy, Arunselvan; Bhatnagar, Shalabh;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Gradient Descent Optimization-Based SINS Self-Alignment Method and Error Analysis [J] . Jingchun Li, Jiachong Chang, Ya Zhang Quality Control, Transactions . 2021 ,第1期

机译：基于梯度下降优化的SINS自对准方法和误差分析
2. PARALLEL RANDOM COORDINATE DESCENT METHOD FOR COMPOSITE MINIMIZATION: CONVERGENCE ANALYSIS AND ERROR BOUNDS [J] . Necoara Ion, Clipici Dragos SIAM Journal on Optimization: A Publication of the Society for Industrial and Applied Mathematics . 2016 ,第1期

机译：复合最小化的并行随机坐标下降法：收敛性分析和误差界
3. New analysis of linear convergence of gradient-type methods via unifying error bound conditions [J] . Mathematical Programming . 2020 ,第1a2期

机译：通过统一误差束缚条件新分析梯度型方法的线性收敛
4. Generalization Error Bounds of Gradient Descent for Learning Over-Parameterized Deep ReLU Networks [C] . Yuan Cao, Quanquan Gu AAAI Conference on Artificial Intelligence . 2020

机译：学习过度参数化深度相关网络梯度下降的泛化误差
5. Solving nonlinear differential equations by one-step methods: Global error bounds and complexity analysis. [D] . Huang, Yili. 1993

机译：通过一步法求解非线性微分方程：全局误差范围和复杂性分析。
6. Implicit Stochastic Gradient Descent Method for Cross-Domain Recommendation System [O] . Nam D. Vo, Minsung Hong, Jason J. Jung 2020

机译：跨域推荐系统的隐式随机梯度下降方法
7. Parallel coordinate descent methods for composite minimization: convergence analysis and error bounds [O] . Necoara, Ion, Clipici, Dragos 2015

机译：用于复合最小化的平行坐标下降方法：收敛分析和误差界限
8. Asymptotic Bounds on the Errors of One-Step Methods [R] . Shampine, L. F. 1983

机译：一步法误差的渐近界

Analysis of gradient descent methods with non-diminishing, bounded errors

摘要

著录项

相似文献

相关主题

期刊订阅